5/ TL^ner 
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PAGE: 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/235,416 



DATE: 05/07/1999 
TIME: 11:44:45 



Input Set: 1235416. RAW 



This Raw Listing contains the General Information 
Section and up to first 5 pages. 



1 


<110> 


2 




3 




4 


<120> 


5 




6 


<130> 


7 


<140> 


8 


<141> 


9 


<150> 


10 


<151> 


11 


<150> 


12 


<151> 


13 


<160> 


14 


<170> 


15 


<210> 


16 


<211> 


17 


<212> 


18 


<213> 


19 


<220> 


20 


<223> 


21 




22 


<220> 


23 


<221> 


24 


<222> 


25 


<223> 


26 


<220> 


27 


<221> 


28 


<222> 


29 


<223> 


30 


<220> 


31 


<221> 


32 


<222> 


33 


<223> 


34 


<220> 


35 


<221> 


36 


<222> 


37 


<223> 


38 


<220> 


39 


<221> 


40 


<222> 


41 


<223> 


42 


<220> 


43 


<221> 


44 


<222> 




APPLICANT: Sakowicz, Roman 
Goldstein, Lawrence S. B. 
The Regents of the University of California 

TITLE OF INVENTION: Identification and Expression of a Novel Kinesin Motor 
Protein 

FILE REFERENCE: 18557C- 000710US 

CURRENT APPLICATION NUMBER: US/09/235 , 416 

CURRENT FILING DATE: 1999-01-22 n 

EARLIER APPLICATION NUMBER: WO PCT/US99/01355 Q ®$ Nty Q 

EARLIER FILING DATE: 1999-01-22 ^ 9 Cfetf £) /<; i 0/ ^P/]/ 

EARLIER APPLICATION NUMBER: US 60/072,361 ^® ^ e ©(ferf 

EARLIER FILING DATE: 1998-01-23 

NUMBER OF SEQ ID NOS : 7 

SOFTWARE : Patent In Ver . 2.0 

SEQ ID NO 1 

LENGTH: 784 

TYPE: PRT 

ORGANISM: Thermomyces lanuginosus 
FEATURE : 

OTHER INFORMATION: TL-gamma ATP-dependent plus end-directed 
microtubule motor protein 
FEATURE : 

NAME /KEY: DOMAIN 
LOCATION: (1) . . (357) 

OTHER INFORMATION: kinesin-like microtubule motor domain 
FEATURE : 

NAME /KEY: DOMAIN 
LOCATION: (358) . . (442) 

OTHER INFORMATION: neck domain links motor domain to stalk domain 
FEATURE : 

NAME/KEY: DOMAIN 
LOCATION: (443) . . (601) 

OTHER INFORMATION: stalk domain, unc-104 family domain 
FEATURE : 

NAME /KEY: DOMAIN 

LOCATION: (602) . . (784) 

OTHER INFORMATION: tail domain 

FEATURE : 

NAME/KEY: VARIANT 
LOCATION: (713) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: VARIANT 
LOCATION: (762) 



polymorphic variant 
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PAGE: 2 RAW SEQUENCE LISTING DATE: 05/07/1999 

PATENT APPLICATION US/09/235,416 TIME: 11:44:45 

T^rmf. Set: 1235416. RAW 

45 <223> OTHER INFORMATION: polymorphic variant tta^Asp -> Glu) 

46 <220> FEATURE: I J 

47 <221> NAME /KEY : VARIANT ^ 

48 <222> LOCATION: (774) &AA^ 

49 <223> OTHER INFORMATION: polymorphic variant #3/<£lu -> Asp/ /X^^^^ 

50 <400> SEQUENCE: 1 

51 Met Ser Gly Gly Gly Asn lie Lys Val Val Val Arg Val Arg Pro Phe 

52 1 5 10 15 

53 Asn Ala Arg Glu lie Asp Arg Gly Ala Lys Cys lie Val Arg Met Glu 

54 20 25 30 

55 Gly Asn Gin Thr lie Leu Thr Pro Pro Pro Gly Ala Glu Glu Lys Ala 

56 35 40 45 

57 Arg Lys Ser Gly Lys Thr lie Met Asp Gly Pro Lys Ala Phe Ala Phe 

58 50 55 60 

59 Asp Arg Ser Tyr Trp Ser Phe Asp Lys Asn Ala Pro Asn Tyr Ala Arg 

60 65 70 75 80 

61 Gin Glu Asp Leu Phe Gin Asp Leu Gly Val Pro Leu Leu Asp Asn Ala 

62 85 90 95 

63 Phe Lys Gly Tyr Asn Asn Cys lie Phe Ala Tyr Gly Gin Thr Gly Ser 

64 100 105 110 

65 Gly Lys Ser Tyr Ser Met Met Gly Tyr Gly Lys Glu His Gly Val lie 

66 115 120 125 

67 Pro Arg lie Cys Gin Asp Met Phe Arg Arg lie Asn Glu Leu Gin Lys 

68 130 135 140 

69 Asp Lys Asn Leu Thr Cys Thr Val Glu Val Ser Tyr Leu Glu lie Tyr 

70 145 150 155 160 

71 Asn Glu Arg Val Arg Asp Leu Leu Asn Pro Ser Thr Lys Gly Asn Leu 

72 165 170 175 

73 Lys Val Arg Glu His Pro Ser Thr Gly Pro Tyr Val Glu Asp Leu Ala 

74 180 185 190 

75 Lys Leu Val Val Arg Ser Phe Gin Glu lie Glu Asn Leu Met Asp Glu 

76 195 200 205 

77 Gly Asn Lys Ala Arg Thr Val Ala Ala Thr Asn Met Asn Glu Thr Ser 

78 210 215 220 

79 Ser Arg Ser His Ala Val Phe Thr Leu Thr Leu Thr Gin Lys Trp His 

80 225 230 235 240 

81 Asp Glu Glu Thr Lys Met Asp Thr Glu Lys Val Ala Lys lie Ser Leu 

82 245 250 255 

83 Val Asp Leu Ala Gly Ser Glu Arg Ala Thr Ser Thr Gly Ala Thr Gly 

84 260 265 270 

85 Ala Arg Leu Lys Glu Gly Ala Glu lie Asn Arg Ser Leu Ser Thr Leu 

86 275 280 285 

87 Gly Arg Val lie Ala Ala Leu Ala Asp Met Ser Ser Gly Lys Gin Lys 

88 290 295 300 

89 Lys Asn Gin Leu Val Pro Tyr Arg Asp Ser Val Leu Thr Trp Leu Leu 

90 305 310 315 320 

91 Lys Asp Ser Leu Gly Gly Asn Ser Met Thr Ala Met lie Ala Ala lie 

92 325 330 335 

93 Ser Pro Ala Asp lie Asn Phe Glu Glu Thr Leu Ser Thr Leu Arg Tyr 

94 340 345 350 
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PAGE: 3 RAW SEQUENCE LISTING DATE: 05/07/1999 

PATENT APPLICATION US/09/235,416 TIME: 11:44:45 

Input Set: 1235416. RAW 



95 


Ala 


Asp 


Ser 


Ala 


Lys 


Arg 


He 


Lys 


Asn 


His 


Ala 


Val 


Val 


Asn 


Glu 


Asp 


96 






355 










360 










365 








97 


Pro 


Asn 


Ala 


Arg 


Met 


He 


Arg 


Glu 


Leu 


Lys 


Glu 


Glu 


Leu 


Ala 


Gin 


Leu 


98 




370 










375 










380 










99 


Arg 


Ser 


Lys 


Leu 


Gin 


Ser 


Ser 


Gly 


Gly 


Gly 


Gly 


Gly 


Gly 


Ala 


Gly 


Gly 


100 


385 










390 










395 










400 


101 


Ser 


Gly 


Gly 


Pro 


Val 


Glu 


Glu 


Ser 


Tyr 


Pro 


Pro 


Asp 


Thr 


Pro 


Leu 


Glu 


102 










405 










410 










415 




103 


Lys 


Gin 


He 


Val 


Ser 


He 


Gin 


Gin 


Pro 


Asp 


Ala 


Thr 


Val 


Lys 


Lys 


Met 


104 








420 










425 










430 






105 


Ser 


Lys 


Ala 


Glu 


He 


Val 


Glu 


Gin 


Leu 


Asn 


Gin 


Ser 


Glu 


Lys 


Leu 


Tyr 


106 






435 










440 










445 








107 


Arg 


Asp 


Leu 


Asn 


Gin 


Thr 


Trp 


Glu 


Glu 


Lys 


Leu 


Ala 


Lys 


Thr 


Glu 


Glu 


108 




450 










455 










460 










109 


He 


His 


Lys 


Glu 


Arg 


Glu 


Ala 


Ala 


Leu 


Glu 


Glu 


Leu 


Gly 


He 


Ser 


He 


110 


465 










470 










475 










480 


111 


Glu 


Lys 


Gly 


Phe 


Val 


Gly 


Pro 


Tyr 


His 


Ser 


Lys 


Glu 


Met 


Pro 


His 


Leu 


112 










485 










490 










495 




113 


Val 


Asn 


Leu 


Ser 


Asp 


Asp 


Pro 


Leu 


Leu 


Ala 


Glu 


Cys 


Leu 


Val 


Tyr 


Asn 


114 








500 










505 










510 






115 


He 


Lys 


Pro 


Gly 


Gin 


Thr 


Arg 


Val 


Gly 


Asn 


Val 


Asn 


Gin 


Asp 


Thr 


Gin 


116 






515 










520 










525 








117 


Ala 


Glu 


He 


Arg 


Leu 


Asn 


Gly 


Ser 


Lys 


He 


Leu 


Lys 


Glu 


His 


Cys 


Thr 


118 




530 










535 










540 










119 


Phe 


Glu 


Asn 


Val 


Asp 


Asn 


Val 


Val 


Thr 


He 


Val 


Pro 


Asn 


Glu 


Lys 


Ala 


120 


545 










550 










555 










560 


121 


Ala 


Val 


Met 


Val 


Asn 


Gly 


Val 


Arg 


He 


Asp 


Lys 


Pro 


Thr 


Arg 


Leu 


Arg 


122 










565 










570 










575 




123 


Ser 


Gly 


Tyr 


Arg 


He 


He 


Leu 


Gly 


Asp 


Phe 


His 


He 


Phe 


Arg 


Phe 


Asn 


124 








580 










585 










590 






125 


His 


Pro 


Glu 


Glu 


Ala 


Arg 


Ala 


Glu 


Arg 


Gin 


Glu 


Gin 


Ser 


Leu 


Leu 


Arg 


126 






595 










600 










605 








127 


His 


Ser 


Val 


Thr 


Asn 


Ser 


Gin 


Leu 


Gly 


Ser 


Pro 


Ala 


Pro 


Gly 


Arg 


His 


128 




610 










615 










620 










129 


Asp 


Arg 


Thr 


Leu 


Ser 


Lys 


Ala 


Gly 


Ser 


Asp 


Ala 


Asp 


Gly 


Asp 


Ser 


Arg 


130 


625 










630 










635 










640 


131 


Ser 


Asp 


Ser 


Pro 


Leu 


Pro 


His 


Phe 


Arg 


Gly 


Lys 


Asp 


Ser 


Asp 


Trp 


Phe 


132 










645 










650 










655 




133 


Tyr 


Ala 


Arg 


Arg 


Glu 


Ala 


Ala 


Ser 


Ala 


He 


Leu 


Gly 


Leu 


Asp 


Gin 


Lys 


134 








660 










665 










670 






135 


He 


Ser 


His 


Leu 


Thr 


Asp 


Asp 


Glu 


Leu 


Asp 


Ala 


Leu 


Phe 


Asp 


Asp 


Val 


136 






675 










680 










685 








137 


Gin 


Lys 


Ala 


Arg 


Ala 


Val 


Arg 


Arg 


Gly 


Leu 


Val 


Glu 


Asp 


Asn 


Glu 


Asp 


138 




690 










695 










700 










139 


Ser 


Asp 


Ser 


Gin 


Ser 


Ser 


Phe 


Pro( 


Q 


'Arg 


Asp 


Lys 


Tyr 


Met 


Ser 


Asn 


140 


705 










710 










715 










720 


141 


Gly 


Thr 


He 


Asp 


Asn 


Phe 


Ser 


Leu Asp 


Thr 


Ala 


He 


Thr 


Met 


Pro 


Gly 


142 










725 










730 










735 




143 


Thr 


Pro 


Arg 


Ser 


Asp 


Asp 


Asp 


Gly Asp Ala 


Leu 


Phe 


Phe 


Gly 


Asp 


Lys 


144 








740 










745 










750 
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PAGE : 4 RAW SEQUENCE LISTING DATE: 05/07/1999 

PATENT APPLICATION US/09/235,416 TIME: 11:44:45 

'^Input Set: 1235416. RAW 

145 Lys Ser Lys Gin Asp Ala Ser Asn Val/Asp/Val Glu Glu Leu Arg Gin 

146 755 jT\\ieO 765 

147 Gin Gin Ala Gin Met/Glu/Glu/Ala Leu Lys Thr Ala Lys Gin Glu Phe 

148 770 y/W^V > lf £ 780 

149 <210> SEQ ID NO 2 J QjC~ 

150 <211> LENGTH: 2352 /^jUT^ / 

151 <212> TYPE: DNA fiA/^ 

152 <213> ORGANISM: Thermomyces lanuginosus 

153 <22 0> FEATURE: 

154 <223> OTHER INFORMATION: TL-gamma ATP-dependent plus end-directed 

155 microtubule motor protein 

156 <400> SEQUENCE: 2 

157 atgtcgggcg gtggaaatat caaggtggtg gtgcgggtac gcccgttcaa cgcccgagaa 60 

158 atcgaccgtg gcgcaaaatg tattgtgcgg atggaaggaa atcaaaccat cctcacccct 120 

159 cctccgggtg ccgaagagaa ggcgcgtaaa agtggcaaaa ctattatgga tggcccgaag 180 

160 gcatttgcgt tcgatcggtc gtattggtcc tttgacaaga atgctcccaa ctatgcgaga 240 

161 caggaagacc tattccaaga tctcggagtc ccgcttctgg ataatgcatt caagggttat 300 

162 aacaattgta tcttcgccta cggtcagacc ggttcgggca agtcctattc aatgatgggc 3 60 

163 tatggcaagg agcatggcgt gatcccgcgg atttgccagg acatgttccg gcgtattaat 420 

164 gaactgcaga aggacaagaa cctcacttgc accgtcgaag tttcgtactt ggaaatttac 480 

165 aatgaacgag tgcgagactt gctgaatccg tcgacaaagg ggaatctcaa ggtccgagaa 540 

166 cacccgtcga ccggccccta cgtggaggac ttggcgaagc tggtcgtgcg atcattccaa 600 

167 gaaatcgaaa atctcatgga tgagggcaac aaagccagaa cggttgccgc cacaaacatg 660 

168 aacgagacat ccagtcgatc ccacgccgtc ttcactttga ccttgacgca aaagtggcat 720 

169 gatgaagaga ccaaaatgga cacagagaag gttgcgaaga tcagtctggt agatttggcg 780 

170 ggttctgagc gagcaacgtc caccggagct actggagcgc gactgaagga gggtgcagag 840 

171 atcaaccgct cactttcgac cctaggtcgt gtgattgcag cgctagcgga tatgtcgtcg 900 

172 ggaaaacaga agaagaatca gttagtacct taccgagatt cggtactgac gtggcttctg 960 

173 aaggactcct tgggaggcaa ctcgatgacc gccatgattg ccgccatttc gcctgctgat 102 0 

174 attaactttg aagagactct cagtaccctt cgatatgcgg actctgcgaa gcgaatcaag 1080 

175 aaccacgcag tggtcaatga agacccgaac gcgcggatga tccgcgagtt gaaggaggaa 1140 

176 ctcgcgcagc tgaggagcaa actccagagc agtggtggag gtggaggtgg tgcaggaggt 120 0 

177 tctggcgggc cagtggagga atcgtacccg cccgacacgc cgctcgagaa gcaaatcgtg 1260 

178 tcgattcagc agccggatgc gacagtcaag aaaatgagca aggcagaaat cgtggagcaa 1320 

179 ctgaaccaga gtgagaagct ctatcgggat ctcaatcaga cctgggaaga gaagctggcc 1380 

180 aagaccgagg aaatccacaa ggaacgagaa gcggcgctcg aggagctggg tatcagcatc 144 0 

181 gaaaagggct ttgttggccc ttaccactcc aaagaaatgc cacatctagt caacttgagc 1500 

182 gatgatcctc ttctggctga gtgtcttgtc tacaacatca agcccgggca gacaagggtt 1560 

183 ggaaacgtca accaagatac acaagcggaa attcgtctga acggttcgaa gatcctgaaa 162 0 

184 gaacactgta cgtttgaaaa tgtggacaac gttgtgacca tcgtgccaaa cgagaaggct 1680 

185 gctgtcatgg tgaacggcgt gcgaatcgac aagcctactc gcctccgcag cggctacagg 1740 

186 atcatcctgg gcgatttcca catttttcga ttcaaccatc cggaagaagc tcgtgcggaa 1800 

187 cggcaagaac aatccttgct tcgccattct gtcaccaaca gtcagttggg ttcgcctgct 1860 

188 ccaggccgtc acgaccggac actgagcaag gcgggttcgg atgcggacgg cgattctcgc 192 0 

189 tcagattctc ctttgccgca ctttcgtgga aaggatagcg actggttcta tgctcgcagg 1980 

190 gaagctgcta gcgcgatcct agggttggat cagaagatct ctcatctgac agatgacgag 2 040 

191 ttggatgcat tatttgacga tgttcagaaa gcgcgggcag ttcgtcgtgg gctggtcgaa 2100 

192 gacaacgaag atagcgattc gcagagttcg tttccggtcc gtgacaaata catgtccaat 216 0 

193 ggaaccattg ataatttctc gctcgatacc gccattacta tgccgggtac ccctcgtagt 222 0 

194 gatgacgacg gtgacgcgct gttttttggt gataagaagt cgaaacagga tgcgtctaat 2280 




1645 



PAGE: 5 RAW SEQUENCE LISTING DATE: 05/07/1999 

PATENT APPLICATION US/09/235,416 TIME: 11:44:45 

Input Set: 1235416. RAW 

gttgatgttg aggagttgcg tcaacagcag gctcagatgg aagaagccct gaaaacagcg 2340 
aagcaggaat tc 2352 
SEQ ID NO 3 
LENGTH: 21 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence : primer 
SEQUENCE : 3 

atgtcgggcg gtggaaatat c 21 
SEQ ID NO 4 
LENGTH: 23 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence : primer 
SEQUENCE: 4 

gaattcctgc ttcgctgttt tea 23 
SEQ ID NO 5 
LENGTH : 3 0 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence : degenerate 
forward primer 
FEATURE : 

NAME / KEY : modif ied_base 
LOCATION: (25) ^ 
OTHER INFORMATION: n = a, C, g or t 
SEQUENCE : 5 

W-->Qp225 gcgcggatcc atyttygcht ayggncarac 30 

SEQ ID NO 6 
LENGTH: 3 0 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence : degenerate 
reverse primer 
FEATURE : 

NAME/KEY: modif ied_base 
LOCATION: (16) ^ 
OTHER INFORMATION: n = a, C, g or t 
FEATURE : 

NAME/KEY: modif ied_base 
LOCATION: (28) 

OTHER INFORMATION: n = a, c, g or t 
SEQUENCE: 6 

gegegaatte ^tedgancedg ^cyarrtcnac 30 

SEQ ID NO 7 
LENGTH: 3 0 



195 




196 




197 


<210> 


198 


<211> 


199 


<212> 


200 


<213> 


201 


<220> 


202 


<223> 


203 


<400> 


204 




205 


<210> 


206 


<211> 


207 


<212> 


208 


<213> 


209 


<220> 


210 


<223> 


211 


<400> 


212 




213 


<210> 


214 


<211> 


215 


<212> 


216 


<213> 


217 


<220> 


218 


<223> 


219 




220 


<220> 


221 


<221> 


222 


<222> 


223 


<223> 


224 


<400> 


r--><3^225 




226 


<210> 


227 


<211> 


228 


<212> 


229 


<213> 


230 


<220> 


231 


<223> 


232 




233 


<220> 


234 


<221> 


235 


<222> 


236 


<223> 


237 


<220> 


238 


<221> 


239 


<222> 


^240 


<223> 




<400> 


-->UV242 




243 


<210> 


244 


<211> 


Please Note: 





Sequence Listing to ensure that a corresponding explanation is presented in the <220> to 
< 223 > fields of each sequence which presents at least one n or Xaa. 



4^ VERIFICATION SUMMARY 



*>AGEf 6 VERIFICATION SUMMARY 1^ DATE: 05/07/1999 

PATENT APPLICATION US/09/235,416 TIME: 11:44:45 

Input Set: 1235416. RAW 



Line ? Error/Warning Original Text 



225 W "N" or "Xaa" used: Feature required gcgcggatcc atyttygcht ayggncarac 

242 W "N'| or "Xaa" used: Feature required gcgcgaattc tcdganccdg cvarrtcnac 

259 W "N" or "Xaa" used: Feature required gcgcgaattc tcdctnccdg cvarrtcnac 



